NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Towards generalizable and interpretable three-dimensional tracking with inverse neural rendering

https://doi.org/10.1038/s42256-025-01083-x

Ost, Julian; Banerjee, Tanushree; Bijelic, Mario; Heide, Felix (August 2025, Nature Machine Intelligence)

Free, publicly-accessible full text available August 1, 2026
Beating spectral bandwidth limits for large aperture broadband nano-optics

https://doi.org/10.1038/s41467-025-58208-4

Fröch, Johannes E; Chakravarthula, Praneeth; Sun, Jipeng; Tseng, Ethan; Colburn, Shane; Zhan, Alan; Miller, Forrest; Wirth-Singh, Anna; Tanguy, Quentin_A A; Han, Zheyi; et al (December 2025, Nature Communications)

Free, publicly-accessible full text available December 1, 2026
Computational imaging with meta-optics

https://doi.org/10.1364/OPTICA.546382

Fröch, Johannes_E; Colburn, Shane; Brady, David_J; Heide, Felix; Veeraraghavan, Ashok; Majumdar, Arka (May 2025, Optica)

Sub-wavelength diffractive meta-optics have emerged as a versatile platform to manipulate light fields at will, due to their ultra-small form factor and flexible multifunctionalities. However, miniaturization and multimodality are typically compromised by a reduction in imaging performance; thus, meta-optics often yield lower resolution and stronger aberration compared to traditional refractive optics. Concurrently, computational approaches have become popular to improve the image quality of traditional cameras and exceed limitations posed by refractive lenses. This in turn often comes at the expense of higher power and latency, and such systems are typically limited by the availability of certain refractive optics. Limitations in both fields have thus sparked cross-disciplinary efforts to not only overcome these roadblocks but also to go beyond and provide synergistic meta-optical–digital solutions that surpass the potential of the individual components. For instance, an application-specific meta-optical frontend can preprocess the light field of a scene and focus it onto the sensor with a desired encoding, which can either ease the computational load on the digital backend or can intentionally alleviate certain meta-optical aberrations. In this review, we introduce the fundamentals, summarize the development of meta-optical computational imaging, focus on latest advancements that redefine the current state of the art, and give a perspective on research directions that leverage the full potential of sub-wavelength photonic platforms in imaging and sensing applications. The current advancement of meta-optics and recent investments by foundries and technology partners have the potential to provide synergistic future solutions for highly efficient, compact, and low-power imaging systems.
more » « less
Spatially varying nanophotonic neural networks

https://doi.org/10.1126/sciadv.adp0391

Wei, Kaixuan; Li, Xiao; Froech, Johannes; Chakravarthula, Praneeth; Whitehead, James; Tseng, Ethan; Majumdar, Arka; Heide, Felix (November 2024, Science Advances)

The explosive growth in computation and energy cost of artificial intelligence has spurred interest in alternative computing modalities to conventional electronic processors. Photonic processors, which use photons instead of electrons, promise optical neural networks with ultralow latency and power consumption. However, existing optical neural networks, limited by their designs, have not achieved the recognition accuracy of modern electronic neural networks. In this work, we bridge this gap by embedding parallelized optical computation into flat camera optics that perform neural network computations during capture, before recording on the sensor. We leverage large kernels and propose a spatially varying convolutional network learned through a low-dimensional reparameterization. We instantiate this network inside the camera lens with a nanophotonic array with angle-dependent responses. Combined with a lightweight electronic back-end of about 2K parameters, our reconfigurable nanophotonic neural network achieves 72.76% accuracy on CIFAR-10, surpassing AlexNet (72.64%), and advancing optical neural networks into the deep learning era.
more » « less
Full Text Available
∇-Prox: Differentiable Proximal Algorithm Modeling for Large-Scale Optimization

https://doi.org/10.1145/3592144

Lai, Zeqiang; Wei, Kaixuan; Fu, Ying; Härtel, Philipp; Heide, Felix (August 2023, ACM Transactions on Graphics)

Tasks across diverse application domains can be posed as large-scale optimization problems, these include graphics, vision, machine learning, imaging, health, scheduling, planning, and energy system forecasting. Independently of the application domain, proximal algorithms have emerged as a formal optimization method that successfully solves a wide array of existing problems, often exploiting problem-specific structures in the optimization. Although model-based formal optimization provides a principled approach to problem modeling with convergence guarantees, at first glance, this seems to be at odds with black-box deep learning methods. A recent line of work shows that, when combined with learning-based ingredients, model-based optimization methods are effective, interpretable, and allow for generalization to a wide spectrum of applications with little or no extra training data. However, experimenting with such hybrid approaches for different tasks by hand requires domain expertise in both proximal optimization and deep learning, which is often error-prone and time-consuming. Moreover, naively unrolling these iterative methods produces lengthy compute graphs, which when differentiated via autograd techniques results in exploding memory consumption, making batch-based training challenging. In this work, we introduce ∇-Prox, a domain-specific modeling language and compiler for large-scale optimization problems using differentiable proximal algorithms. ∇-Prox allows users to specify optimization objective functions of unknowns concisely at a high level, and intelligently compiles the problem into compute and memory-efficient differentiable solvers. One of the core features of ∇-Prox is its full differentiability, which supports hybrid model- and learning-based solvers integrating proximal optimization with neural network pipelines. Example applications of this methodology include learning-based priors and/or sample-dependent inner-loop optimization schedulers, learned with deep equilibrium learning or deep reinforcement learning. With a few lines of code, we show ∇-Prox can generate performant solvers for a range of image optimization problems, including end-to-end computational optics, image deraining, and compressive magnetic resonance imaging. We also demonstrate ∇-Prox can be used in a completely orthogonal application domain of energy system planning, an essential task in the energy crisis and the clean energy transition, where it outperforms state-of-the-art CVXPY and commercial Gurobi solvers.
more » « less
Full Text Available
Single Depth-image 3D Reflection Symmetry and Shape Prediction

https://doi.org/10.1109/ICCV51070.2023.00817

Zhang, Zhaoxuan; Dong, Bo; Li, Tong; Heide, Felix; Peers, Pieter; Yin, Baocai; Yang, Xin (October 2023, IEEE)

Full Text Available
Multi-view Spectral Polarization Propagation for Video Glass Segmentation

https://doi.org/10.1109/ICCV51070.2023.02122

Qiao, Yu; Dong, Bo; Jin, Ao; Fu, Yu; Baek, Seung-Hwan; Heide, Felix; Peers, Pieter; Wei, Xiaopeng; Yang, Xin (October 2023, IEEE)

Full Text Available
Stochastic Light Field Holography

https://doi.org/10.1109/ICCP56744.2023.10233716

Schiffers, Florian; Chakravarthula, Praneeth; Matsuda, Nathan; Kuo, Grace; Tseng, Ethan; Lanman, Douglas; Heide, Felix; Cossairt, Oliver (July 2023, 2023 IEEE International Conference on Computational Photography (ICCP))

The Visual Turing Test is the ultimate goal to evaluate the realism of holographic displays. Previous studies have focused on addressing challenges such as limited e ́tendue and image quality over a large focal volume, but they have not investigated the effect of pupil sampling on the viewing experience in full 3D holograms. In this work, we tackle this problem with a novel hologram generation algorithm motivated by matching the projection operators of incoherent (Light Field) and coherent (Wigner Function) light transport. To this end, we supervise hologram computation using synthesized photographs, which are rendered on-the-fly using Light Field refocusing from stochastically sampled pupil states during optimization. The proposed method produces holograms with correct parallax and focus cues, which are important for passing the Visual Turing Test. We validate that our approach compares favorably to state-of-the-art CGH algorithms that use Light Field and Focal Stack supervision. Our experiments demonstrate that our algorithm improves the viewing experience when evaluated under a large variety of different pupil states.
more » « less
Full Text Available
In the Blink of an Eye: Event-based Emotion Recognition

https://doi.org/10.1145/3588432.3591511

Zhang, Haiwei; Zhang, Jiqing; Dong, Bo; Peers, Pieter; Wu, Wenwei; Wei, Xiaopeng; Heide, Felix; Yang, Xin (July 2023, ACM)

Full Text Available
Hogel-Free Holography

https://doi.org/10.1145/3516428

Chakravarthula, Praneeth; Tseng, Ethan; Fuchs, Henry; Heide, Felix (October 2022, ACM Transactions on Graphics)

Holography is a promising avenue for high-quality displays without requiring bulky, complex optical systems. While recent work has demonstrated accurate hologram generation of 2D scenes, high-quality holographic projections of 3D scenes has been out of reach until now. Existing multiplane 3D holography approaches fail to model wavefronts in the presence of partial occlusion while holographic stereogram methods have to make a fundamental tradeoff between spatial and angular resolution. In addition, existing 3D holographic display methods rely on heuristic encoding of complex amplitude into phase-only pixels which results in holograms with severe artifacts. Fundamental limitations of the input representation, wavefront modeling, and optimization methods prohibit artifact-free 3D holographic projections in today’s displays. To lift these limitations, we introduce hogel-free holography which optimizes for true 3D holograms, supporting both depth- and view-dependent effects for the first time. Our approach overcomes the fundamental spatio-angular resolution tradeoff typical to stereogram approaches. Moreover, it avoids heuristic encoding schemes to achieve high image fidelity over a 3D volume. We validate that the proposed method achieves 10 dB PSNR improvement on simulated holographic reconstructions. We also validate our approach on an experimental prototype with accurate parallax and depth focus effects.
more » « less
Full Text Available

« Prev Next »

Search for: All records